Robust Chart Parsing with Mildly Inconsistent Feature Structures

نویسندگان

  • Carl Vogel
  • Robin Cooper
چکیده

We introduce the formal underpinnings of our theory of non-classical feature structures. The resulting expanded universe of feature structures has direct impfications for robust parsing for linguistic theories founded upon feature theory. We present an implementation of a robust chart parser for Head-driven Phrase Structure Grammar (HPSG). The problem of relaxed unification is in limiting it so that arbitrarily nongrammatical inputs needn’t be accepted. In the worst case, excessively ’relaxed’ parsers cannot provide meaningful interpretations to process. However, parsers which can guarantee minimally nongrammatical (inconsistent) interpretations provide an important tool for grammar development and online robust processing of natural language. Our parser prefers maximally consistent interpretations, and accommodates inconsistencies by discovering the minimal set of constraints that need to be relaxed for a parse to go through, without employing backtracking or post processing. The system is different from other related relaxational techniques for unification grammars which require advanced naming of features whose constraints are allowed to be relaxed. Yet it is compatible with those approaches in that there is a well defined location for preferences on sources of inconsistency to be named, as well as for resolution of inconsistent information. We use a simple approach to the problem of unknown words and suggest generalization of that for coping with missing and extra elements in an input.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Treebank Grammar Techniques for Non-Projective Dependency Parsing

An open problem in dependency parsing is the accurate and efficient treatment of non-projective structures. We propose to attack this problem using chart-parsing algorithms developed for mildly contextsensitive grammar formalisms. In this paper, we provide two key tools for this approach. First, we show how to reduce nonprojective dependency parsing to parsing with Linear Context-Free Rewriting...

متن کامل

A Model for Robust Chinese Parser

The Chinese language has many special characteristics which are substantially different from western languages, causing conventional methods of language processing to fail on Chinese. For example, Chinese sentences are composed of strings of characters without word boundaries that are marked by spaces. Therefore, word segmentation and unknown word identification techniques must be used in order...

متن کامل

A Feature Structure Algebra for FTAG

FTAG, the extension of TAG with feature structures, lags behind other featurebased grammar formalisms in the availability of efficient chart parsers. This is in part because of the complex interaction of adjunction and unification, which makes such parsers inconvenient to implement. We present a novel, simple algebra for feature structures and show how FTAG can be encoded as an Interpreted Regu...

متن کامل

Address Recognition with Robust NLU Technology

An analysis component recognizing addresses on printed documents and identifying the respective sender or recipient is presented. The component’s kernel is a standard chart parser for feature-based context-free grammars. The parsing procedure is enhanced with the island-parsing strategy and the ability for partial parsing. The needs of the realworld application within document analysis of print...

متن کامل

Parsing with Typed Feature Structures

In this paper we provide for parsing with respect to grammars expressed in a general TFS-based formalism, a restriction of ALE. Our motivation being the design of an abstract (WAM-like) machine for the formalism, we consider parsing as a computational process and use it as an operational semantics to guide the design of the control structures for the abstract machine. We emphasize the notion of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002